Heterogeneous Explore-Exploit Strategies on Multi-Star Networks

نویسندگان

چکیده

We investigate the benefits of heterogeneity in multi-agent explore-exploit decision making where goal agents is to maximize cumulative group reward. To do so we study a class distributed stochastic bandit problems which communicate over multi-star network and make sequential choices among options same uncertain environment. Typically, problems, use homogeneous decision-making strategies. However, performance can be improved by incorporating into make, especially when graph irregular, i.e., have different numbers neighbors. design analyze new heterogeneous strategies, using as model irregular graph. The key idea enable center more exploring than they would strategy, means providing useful data peripheral agents. In case all broadcast their reward values neighbors with probability, provide theoretical guarantees that improves under proposed strategies compared numerical simulations illustrate our results validate bounds.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Explore or Exploit? Effective Strategies for Disambiguating Large Databases

Data ambiguity is inherent in applications such as data integration, location-based services, and sensor monitoring. In many situations, it is possible to “clean”, or remove, ambiguities from these databases. For example, the GPS location of a user is inexact due to measurement errors, but context information (e.g., what a user is doing) can be used to reduce the imprecision of the location val...

متن کامل

Evaluation of Explore-Exploit Policies in Multi-result Ranking Systems

We analyze the problem of using Explore-Exploit techniques to improve precision in multi-result ranking systems such as web search, query autocompletion and news recommendation. Adopting an exploration policy directly online, without understanding its impact on the production system, may have unwanted consequences the system may sustain large losses, create user dissatisfaction, or collect expl...

متن کامل

Explore vs. Exploit: Task Allocation for Multi-robot Foraging

This paper describes two measures of performance that can be used to allocate tasks in a multirobot foraging problem. These heuristics can be used by a human supervisor or an automated control algorithm to adjust the number of robots exploring the environment versus the number of robots greedily harvesting based on current knowledge. A numerical simulation study is presented that offers prelimi...

متن کامل

Explore, Excogitate, Exploit: Component Mining

nents? In a panel at TOOLS USA 1994, Eric Aranow nicely outlined the basic question: “Is it nature or nurture?” In other words, are components born— devised from the start as components— or are components made? That is to say, have they evolved from program elements that may have been originally built for other purposes? Although some panel members argued for the nature view, it was clear to ev...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Control Systems Letters

سال: 2021

ISSN: ['2475-1456']

DOI: https://doi.org/10.1109/lcsys.2020.3042459